Robust connected word speech recognition using weighted viterbi algorithm and context-dependent temporal constraints
نویسندگان
چکیده
This paper addresses the problem of connected word speech recognition with signals corrupted by additive and convolutional noise. Context-dependent temporal constraints are proposed and compared with the ordinary temporal restrictions, and used in combination with the weighted Viterbi algorithm which had been tested with isolated word recognition experiments in previous papers. Connected-word recognition tests show that the weighted Viterbi algorithm depends on the accuracy of the state duration modelling and the approach here covered can lead to reductions as high as 90 or 95% in the error rate at moderate SNR using spectral subtraction, an easily implemented technique, even with a poor estimation for noise and without using any information about the speaker. It is also shown that the weighting procedure can reduce the error rate when cepstral mean normalization is also used to cancel both additive and convolutional noise.
منابع مشابه
Temporal constraints in viterbi alignment for speech recognition in noise
This paper addresses the problem of temporal constraints in the Viterbi algorithm using conditional transition probabilities. The results here presented suggest that in a speaker dependent small vocabulary task the statistical modelling of state durations is not relevant if the max and min state duration restrictions are imposed, and that truncated probability densities give better results than...
متن کاملContext-dependent word duration modelling for robust speech recognition
Conventional hidden Markov models (HMMs) have weak duration constraints. This may cause the decoder to produce word matches with unrealistic durations in noisy situations. This paper describes techniques for modelling context-dependent word duration cues and incorporating them directly in a multi-stack decoding algorithm. The proposed model is capable of penalising duration constraints of a wor...
متن کاملWeighted Viterbi algorithm and state duration modelling for speech recognition in noise
A weighted Viterbi algorithm (HMM) is proposed and applied in combination with spectral subtraction and Cepstral Mean Normalization to cancel both additive and convolutional noises in speech recognition. The weighted Viterbi approach is compared and used in combination with state duration modelling. The results presented in this paper show that a proper weight on the information provided by sta...
متن کاملRobust speech recognition based on Viterbi Bayesian predictive classification
In this paper, we investigate a new Bayesian predictive classi cation (BPC) approach to realize robust speech recognition when there exist mismatches between training and test conditions but no accurate knowledge of the mismatch mechanism is available. A speci c approximate BPC algorithm called Viterbi BPC (VBPC) is proposed for both isolated word and continuous speech recognition. The proposed...
متن کاملApplying word duration constraints by using unrolled HMMs
Conventional HMMs have weak duration constraints. In noisy conditions, the mismatch between corrupted speech signals and models trained on clean speech may cause the decoder to produce word matches with unrealistic durations. This paper presents a simple way to incorporate word duration constraints by unrolling HMMs to form a lattice where word duration probabilities can be applied directly to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999